SEQUENCE LISTING 



<110> Strathmann AG & Co . 

<120> Virus-Vaccine 

<130> P057760 

<140> 
<141> 

<150> 199 07 485.2 
<151> 1999-02-12 

<160> 27 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 9709 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 1 

tggaagggct aatttggtcc caaaaaagac aagagatcct tgatctgtgg atctaccaca 60 

cacaaggcta cttccctgat tggcagaact acacaccagg gccagggatc agatatccac 12 0 

tgacctttgg atggtgcttc aagttagtac cagttgaacc agagcaagta gaagaggcca 180 

aataaggaga gaagaacagc ttgttacacc ctatgagcca gcatgggatg gaggacccgg 240 

agggagaagt attagtgtgg aagtttgaca gcctcctagc atttcgtcac atggcccgag 300 

agctgcatcc ggagtactac aaagactgct gacatcgagc tttctacaag ggactttccg 3 60 

ctggggactt tccagggagg tgtggcctgg gcgggactgg ggagtggcga gccctcagat 420 

gctacatata agcagctgct ttttgcctgt actgggtctc tctggttaga ccagatctga 480 

gcctgggagc tctctggcta actagggaac ccactgctta agcctcaata aagcttgcct 540 

tgagtgctca aagtagtgtg tgcccgtctg ttgtgtgact ctggtaacta gagatccctc 600 

agaccctttt agtcagtgtg gaaaatctct agcagtggcg cccgaacagg gacttgaaag 660 

cgaaagtaaa gccagaggag atctctcgac gcaggactcg gcttgctgaa gcgcgcacgg 72 0 

caagaggcga ggggcggcga ctggtgagta cgccaaaaat tttgactagc ggaggctaga 780 

aggagagaga tgggtgcgag agcgtcggta ttaagcgggg gagaattaga taaatgggaa 840 

aaaattcggt taaggccagg gggaaagaaa caatataaac taaaacatat agtatgggca 900 

agcagggagc tagaacgatt cgcagttaat cctggccttt tagagacatc agaaggctgt 960 

agacaaatac tgggacagct acaaccatcc cttcagacag gatcagaaga acttagatca 102 0 

ttatataata caatagcagt cctctattgt gtgcatcaaa ggatagatgt aaaagacacc 1080 

aaggaagcct tagataagat agaggaagag caaaacaaaa gtaagaaaaa ggcacagcaa 1140 

gcagcagctg acacaggaaa caacagccag gtcagccaaa attaccctat agtgcagaac 1200 

ctccaggggc aaatggtaca tcaggccata tcacctagaa ctttaaatgc atgggtaaaa 1260 

gtagtagaag agaaggcttt cagcccagaa gtaataccca tgttttcagc attatcagaa 1320. 

ggagccaccc cacaagattt aaataccatg ctaaacacag tggggggaca tcaagcagcc 1380 

atgcaaatgt taaaagagac catcaatgag gaagctgcag aatgggatag attgcatcca 1440 

gtgcatgcag ggcctattgc accaggccag atgagagaac caaggggaag tgacatagca 1500 

ggaactacta gtacccttca ggaacaaata ggatggatga cacataatcc acctatccca 1560 

gtaggagaaa tctataaaag atggataatc ctgggattaa ataaaatagt aagaatgtat 1620 

agccctacca gcattctgga cataagacaa ggaccaaagg aaccctttag agactatgta 1680 

gaccgattct ataaaactct aagagccgag caagcttcac aagaggtaaa aaattggatg 1740 

acagaaacct tgttggtcca aaatgcgaac ccagattgta agactatttt aaaagcattg 1800 

ggaccaggag cgacactaga agaaatgatg acagcatgtc agggagtggg gggacccggc 1860 

cataaagcaa gagttttggc tgaagcaatg agccaagtaa caaatccagc taccataatg 1920 




atacagaaag gcaattttag gaaccaaaga 
gaagggcaca tagccaaaaa ttgcagggcc 
aaggaaggac accaaatgaa agattgtact 
tggccttccc acaagggaag gccagggaat 
ccaccagaag agagcttcag gtttggggaa 
ccgatagaca aggaactgta tcctttagct 
tcgtcacaat aaagataggg gggcaattaa 
atacagtatt agaagaaatg aatttgccag 
ttggaggttt tatcaaagta ggacagtatg 
aagctatagg tacagtatta gtaggaccta 
tgactcagat tggctgcact ttaaattttc 
aattaaagcc aggaatggat ggcccaaaag 
taaaagcatt agtagaaatt tgtacagaaa 
ggcctgaaaa tccatacaat actccagtat 
ggagaaaatt agtagatttc agagaactta 
aattaggaat accacatcct gcagggttaa 
tgggcgatgc atatttttca gttcccttag 
ccatacctag tataaacaat gagacaccag 
agggatggaa aggatcacca gcaatattcc 
ttagaaaaca aaatccagac atagtcatct 
ctgacttaga aatagggcag catagaacaa 
ggtggggatt taccacacca gacaaaaaac 
gttatgaact ccatcctgat aaatggacag 
gctggactgt caatgacata cagaaattag 
atgcagggat taaagtaagg caattatgta 
aagtagtacc actaacagaa gaagcagagc 
aagaaccggt acatggagtg tattatgacc 
agcaggggca aggccaatgg acatatcaaa 
caggaaaata tgcaagaatg aagggtgccc 
cagtacaaaa aatagccaca gaaagcatag 
tacccataca aaaggaaaca tgggaagcat 
ttcctgagtg ggagtttgtc aatacccctc 
aagaacccat aataggagca gaaactttct 
aattaggaaa agcaggatat gtaactgaca 
acacaacaaa tcagaagact gagttacaag 
tagaagtaaa catagtgaca gactcacaat 
ataagagtga atcagagtta gtcagtcaaa 
tctacctggc atgggtacca gcacacaaag 
tggtcagtgc tggaatcagg aaagtactat 
aacatgagaa atatcacagt aattggagag 
tagtagcaaa agaaatagta gccagctgtg 
atggacaagt agactgtagc ccaggaatat 
aagttatctt ggtagcagtt catgtagcca 
cagagacagg gcaagaaaca gcatacttcc 
aaacagtaca tacagacaat ggcagcaatt 
ggtgggcggg gatcaagcag gaatttggca 
tagaatctat gaataaagaa ttaaagaaaa 
atcttaagac agcagtacaa atggcagtat 
ttggggggta cagtgcaggg gaaagaatag 
aagaattaca aaaacaaatt acaaaaattc 
gagatccagt ttggaaagga ccagcaaagc 
tacaagataa tagtgacata aaagtagtgc 
atggaaaaca gatggcaggt gatgattgtg 
tggaaaagat tagtaaaaca ccatatgtat 
agacatcact atgaaagtac taatccaaaa 
gatgctaaat tagtaataac aacatattgg 
ttgggtcagg gagtctccat agaatggagg 



aagactgtta agtgtttcaa ttgtggcaaa 1980 
cctaggaaaa agggctgttg gaaatgtgga 2040 
gagagacagg ctaatttttt agggaagatc 2100 
tttcttcaga gcagaccaga gccaacagcc 2160 
gagacaacaa ctccctctca gaagcaggag 2220 
tccctcagat cactctttgg cagcgacccc 2280 
aggaagctct attagataca ggagcagatg 2340 
gaagatggaa accaaaaatg atagggggaa 2400 
atcagatact catagaaatc tgcggacata 2460 
cacctgtcaa cataattgga agaaatctgt 2520 
ccattagtcc tattgagact gtaccagtaa 2580 
ttaaacaatg gccattgaca gaagaaaaaa 2 640 
tggaaaagga aggaaaaatt tcaaaaattg 2700 
ttgccataaa gaaaaaagac agtactaaat 2760 
ataagagaac tcaagatttc tgggaagttc 2820 
aacagaaaaa atcagtaaca gtactggatg 2880 
ataaagactt caggaagtat actgcattta 2 940 
ggattagata tcagtacaat gtgcttccac 3000 
agtgtagcat gacaaaaatc ttagagcctt 3 060 
atcaatacat ggatgatttg tatgtaggat 3120 
aaatagagga actgagacaa catctgttga 3180 
atcagaaaga acctccattc ctttggatgg 3240 
tacagcctat agtgctgcca gaaaaggaca 3300 
tgggaaaatt gaattgggca agtcagattt 3360 
aacttcttag gggaaccaaa gcactaacag 342 0 
tagaactggc agaaaacagg gagattctaa 3480 
catcaaaaga cttaatagca gaaatacaga 3540 
tttatcaaga gccatttaaa aatctgaaaa 3 600 
acactaatga tgtgaaacaa ttaacagagg 3 660 
taatatgggg aaagactcct aaatttaaat 3720 
ggtggacaga gtattggcaa gccacctgga 3780 
ccttagtgaa gttatggtac cagttagaga 3 840 
atgtagatgg ggcagccaat agggaaacta 3900 
gaggaagaca aaaagttgtc cccctaacgg 3960 
caattcatct agctttgcag gattcgggat 4020 
atgcattggg aatcattcaa gcacaaccag 4080 
taatagagca gttaataaaa aaggaaaaag 4140 
gaattggagg aaatgaacaa gtagatgggt 4200 
ttttagatgg aatagataag gcccaagaag 4260 
caatggctag tgattttaac ctaccacctg 4320 
ataaatgtca gctaaaaggg gaagccatgc 4380 
ggcagctaga ttgtacacat ttagaaggaa 4440 
gtggatatat agaagcagaa gtaattccag 4500 
tcttaaaatt agcaggaaga tggccagtaa 4560 
tcaccagtac tacagttaag gccgcctgtt 4620 
ttccctacaa tccccaaagt caaggagtaa 4680 
ttataggaca ggtaagagat caggctgaac 4740 
tcatccacaa ttttaaaaga aaagggggga 4800 
tagacataat agcaacagac atacaaacta 4860 
aaaattttcg ggtttattac agggacagca 4920 
tcctctggaa aggtgaaggg gcagtagtaa 4980 
caagaagaaa agcaaagatc atcagggatt 5040 
tggcaagtag acaggatgag gattaacaca 5100 
atttcaagga aagctaagga ctggttttat 5160 
ataagttcag aagtacacat cccactaggg 522 0 
ggtctgcata caggagaaag agact'ggcat 5280 
aaaaagagat atagcacaca agtagaccct 5340 



gacctagcag accaactaat tcatctgcac tattttgatt gtttttcaga atctgctata 5400 
agaaatacca tattaggacg tatagttagt cctaggtgtg aatatcaagc aggacataac 5460 
aaggtaggat ctctacagta cttggcacta gcagcattaa taaaaccaaa acagataaag 5520 
ccacctttgc ctagtgttag gaaactgaca gaggacagat ggaacaagcc ccagaagacc 5580 
aagggccaca gagggagcca tacaatgaat ggacactaga gcttttagag gaacttaaga 5640 
gtgaagctgt tagacatttt cctaggatat ggctccataa cttaggacaa catatctatg 5700 
aaacttacgg ggatacttgg gcaggagtgg aagccataat aagaattctg caacaactgc 5760 
tgtttatcca tttcagaatt gggtgtcgac atagcagaat aggcgttact cgacagagga 582 0 
gagcaagaaa tggagccagt agatcctaga ctagagccct ggaagcatcc aggaagtcag 5880 
cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 5940 
ttcatgacaa aagccttagg catctcctat ggcaggaaga agcggagaca gcgacgaaga 6000 
gctcatcaga acagtcagac tcatcaagct tctctatcaa agcagtaagt agtacatgta 6060 
atgcaaccta taatagtagc aatagtagca ttagtagtag caataataat agcaatagtt 6120 
gtgtggtcca tagtaatcat agaatatagg aaaatattaa gacaaagaaa aatagacagg 6180 
ttaattgata gactaataga aagagcagaa gacagtggca atgagagtga aggagaagta 6240 
tcagcacttg tggagatggg ggtggaaatg gggcaccatg ctccttggga tattgatgat 6300 
ctgtagtgct acagaaaaat tgtgggtcac agtctattat ggggtacctg tgtggaagga 6360 
agcaaccacc actctatttt gtgcatcaga tgctaaagca tatgatacag aggtacataa 6420 
tgtttgggcc acacatgcct gtgtacccac agaccccaac ccacaagaag tagtattggt 6480 
aaatgtgaca gaaaatttta acatgtggaa aaatgacatg gtagaacaga tgcatgagga 6540 
tataatcagt ttatgggatc aaagcctaaa gccatgtgta aaattaaccc cactctgtgt 6600 
tagtttaaag tgcactgatt tgaagaatga tactaatacc aatagtagta gcgggagaat 6660 
gataatggag aaaggagaga taaaaaactg ctctttcaat atcagcacaa gcataagaga 672 0 
taaggtgcag aaagaatatg cattctttta taaacttgat atagtaccaa tagataatac 6780 
cagctatagg ttgataagtt gtaacacctc agtcattaca caggcctgtc caaaggtatc 6840 
ctttgagcca attcccatac attattgtgc cccggctggt tttgcgattc taaaatgtaa 6900 
taataagacg ttcaatggaa caggaccatg tacaaatgtc agcacagtac aatgtacaca 6960 
tggaatcagg ccagtagtat caactcaact gctgttaaat ggcagtctag cagaagaaga 7 020 
tgtagtaatt agatctgcca atttcacaga caatgctaaa accataatag tacagctgaa 7080 
cacatctgta gaaattaatt gtacaagacc caacaacaat acaagaaaaa gtatccgtat 7140 
ccagagggga ccagggagag catttgttac aataggaaaa ataggaaata tgagacaagc 7200 
acattgtaac attagtagag caaaatggaa tgccacttta aaacagatag ctagcaaatt 7260 
aagagaacaa tttggaaata ataaaacaat aatctttaag caatcctcag gaggggaccc 7320 
agaaattgta acgcacagtt ttaattgtgg aggggaattt ttctactgta attcaacaca 7380 
actgtttaat agtacttggt ttaatagtac ttggagtact gaagggtcaa ataacactga 7440 
aggaagtgac acaatcacac tcccatgcag aataaaacaa tttataaaca tgtggcagga 7500 
agtaggaaaa gcaatgtatg cccctcccat cagtggacaa attagatgtt catcaaatat 7560 
tactgggctg ctattaacaa gagatggtgg taataacaac aatgggtccg agatcttcag 7 620 
acctggagga ggcgatatga gggacaattg gagaagtgaa ttatataaat ataaagtagt 7680 
aaaaattgaa ccattaggag tagcacccac caaggcaaag agaagagtgg tgcagagaga 7740 
aaaaagagca gtgggaatag gagctttgtt ccttgggttc ttgggagcag caggaagcac 7800 
tatgggctgc acgtcaatga cgctgacggt acaggccaga caattattgt ctgatatagt 7860 
gcagcagcag aacaatttgc tgagggctat tgaggcgcaa cagcatctgt tgcaactcac 7920 
agtctggggc atcaaacagc tccaggcaag aatcctggct gtggaaagat acctaaagga 7980 
tcaacagctc ctggggattt ggggttgctc tggaaaactc atttgcacca ctgctgtgcc 8040 
ttggaatgct agttggagta ataaatctct ggaacagatt tggaataaca tgacctggat 8100 
ggagtgggac agagaaatta acaattacac aagcttaata cactccttaa ttgaagaatc 8160 
gcaaaaccag caagaaaaga atgaacaaga attattggaa ttagataaat gggcaagttt 8220 
gtggaattgg tttaacataa caaattggct gtggtatata aaattattca taatgatagt 8280 
aggaggcttg gtaggtttaa gaatagtttt tgctgtactt tctatagtga atagagttag 8340 
gcagggatat tcaccattat cgtttcagac ccacctccca atcccgaggg gacccgacag 8400 
gcccgaagga atagaagaag aaggtggaga gagagacaga gacagatcca ttcgattagt 8460 
gaacggatcc ttagcactta tctgggacga tctgcggagc ctgtgcctct tcagctacca 8520 
ccgcttgaga gacttactct tgattgtaac gaggattgtg gaacttctgg gacgcagggg 8580 
gtgggaagcc ctcaaatatt ggtggaatct cctacagtat tggagtcagg aactaaagaa 8640 
tagtgctgtt aacttgctca atgccacagc catagcagta gctgagggga cagatagggt 87 00 
tatagaagta ttacaagcag cttatagagc tattcgccac atacctagaa gaataagaca 8760 



gggcttggaa aggattttgc tataagatgg gtggcaagtg gtcaaaaagt agtgtgattg 882 0 
gatggcctgc tgtaagggaa agaatgagac gagctgagcc agcagcagat ggggtgggag 8880 
cagtatctcg agacctagaa aaacatggag caatcacaag tagcaataca gcagctaaca 8940 
atgctgcttg tgcctggcta gaagcacaag aggaggaaga ggtgggtttt ccagtcacac 9000 
ctcaggtacc tttaagacca atgacttaca aggcagctgt agatcttagc cactttttaa 9060 
aagaaaaggg gggactggaa gggctaattc actcccaaag aagacaagat atccttgatc 9120 
tgtggatcta ccacacacaa ggctacttcc ctgattggca gaactacaca ccagggccag 9180 
gggtcagata tccactgacc tttggatggt gctacaagct ag'taccagtt gagccagata 9240 
aggtagaaga ggccaataaa ggagagaaca ccagcttgtt acaccctgtg agcctgcatg 9300 
gaatggatga ccctgagaga gaagtgttag agtggaggtt tgacagccgc ctagcatttc 9360 
atcacgtggc ccgagagctg catccggagt acttcaagaa ctgctgacat cgagcttgct 9420 
acaagggact ttccgctggg gactttccag ggaggcgtgg cctgggcggg actggggagt 9480 
ggcgagccct cagatgctgc atataagcag ctgctttttg cctgtactgg gtctctctgg 9540 
ttagaccaga tctgagcctg ggagctctct ggctaactag ggaacccact gcttaagcct 9600 
caataaagct tgccttgagt gcttcaagta gtgtgtgccc gtctgttgtg tgactctggt 9660 
aactagagat ccctcagacc cttttagtca gtgtggaaaa tctctagca 9709 



<210> 2 
<211> 854 
<212> PRT 

<213> Human immunodeficiency virus 
<220> 

<223> Envelope Polyprotein 
<400> 2 

Met Arg Val Lys Glu Lys Tyr Gin His Leu Trp Arg Trp Gly Trp Lys 
15 10 15 

Trp Gly Thr Met Leu Leu Gly lie Leu Met lie Cys Ser Ala Thr Glu 
20 25 30 

Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 
35 40 45 

Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 
50 55 60 

Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn 
65 70 75 80 

Pro Gin Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 
85 90 95 

Lys Asn Asp Met Val Glu Gin Met His Glu Asp lie lie Ser Leu Trp 
100 105 110 

Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 
115 120 125 

Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 
130 135 140 



Gly Arg Met lie Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe Asn 
145 150 155 160 



He Ser Thr Ser He Arg Asp Lys Val Gin Lys Glu Tyr Ala Phe Phe 
165 170 175 

Tyr Lys Leu Asp He Val Pro He Asp Asn Thr Ser Tyr Arg Leu He 
180 185 190 

Ser Cys Asn Thr Ser Val He Thr Gin Ala Cys Pro Lys Val Ser Phe 
195 200 205 

Glu Pro He Pro He His Tyr Cys Ala Pro Ala Gly Phe Ala He Leu 
210 215 220 

Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr Asn Val 
225 230 235 240 

Ser Thr Val Gin Cys Thr His Gly lie Arg Pro Val Val Ser Thr Gin 
245 250 255 

Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp Val Val He Arg Ser 
260 265 270 

Ala Asn Phe Thr Asp Asn Ala Lys Thr He He Val Gin Leu Asn Thr 
275 280 285 



Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser 
290 295 300 

He Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Thr He Gly Lys 
305 310 315 320 

He Gly Asn Met Arg Gin Ala His Cys Asn He Ser Arg Ala Lys Trp 
325 330 335 

Asn Ala Thr Leu Lys Gin He Ala Ser Lys Leu Arg Glu Gin Phe Gly 
340 345 350 

Asn Asn Lys Thr He He Phe Lys Gin Ser Ser Gly Gly Asp Pro Glu 
355 360 365 

He Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn 
370 375 380 

Ser Thr Gin Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Trp Ser Thr 
385 390 395 400 

Glu Gly Ser Asn Asn Thr Glu Gly Ser Asp Thr He Thr Leu Pro Cys 
405 410 415 

Arg He Lys Gin Phe He Asn Met Trp Gin Glu Val Gly Lys Ala Met 
420 425 430 

Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys Ser Ser Asn He Thr 
435 440 445 

Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu 



450 455 460 

lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu 
465 470 475 480 

Leu Tyr Lys Tyr Lys Val Val Lys lie Glu Pro Leu Gly Val Ala Pro 
485 490 495 

Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu Lys Arg Ala Val Gly 
500 505 510 

He Gly Ala Leu Phe Leu Gly Phe Leu Gly Ala *Ala Gly Ser Thr Met 
515 520 525 

Gly Cys Thr Ser Met Thr Leu Thr Val Gin Ala Arg Gin Leu Leu Ser 
530 535 540 

Asp He Val Gin Gin Gin Asn Asn Leu Leu Arg Ala He Glu Ala Gin 
545 550 555 560 

Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin Ala 
565 570 575 

Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly 
580 585 590 

He Trp Gly Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp 
595 600 605 

Asn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin He Trp Asn Asn Met 
610 615 620 



Thr Trp Met Glu Trp Asp Arg Glu He Asn Asn Tyr Thr Ser Leu He 
625 630 635 640 

His Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu Gin 
645 650 655 

Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn 
660 665 670 

He Thr Asn Trp Leu Trp Tyr He Lys Leu Phe He Met He Val Gly 
675 680 685 

Gly Leu Val Gly Leu Arg He Val Phe Ala Val Leu Ser He Val Asn 
690 695 700 

Arg Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr His Leu Pro 
705 710 715 720 

He Pro Arg Gly Pro Asp Arg Pro Glu Gly lie Glu Glu Glu Gly Gly 
725 730 735 



Glu Arg Asp Arg Asp Arg Ser He Arg Leu Val Asn Gly Ser Leu Ala 
740 745 750 



Leu lie Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 
755 760 765 



Leu Arg Asp Leu Leu Leu lie Val Thr Arg lie Val Glu Leu Leu Gly 
770 775 780 

Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr 
785 790 795 800 

Trp Ser Gin Glu Leu Lys Asn Ser Ala Val Asn Leu Leu Asn Ala Thr 
805 810 815 

Ala He Ala Val Ala Glu Gly Thr Asp Arg Val He Glu Val Leu Gin 
820 825 830 



Ala Ala Tyr Arg Ala He Arg His He Pro Arg Arg He Arg Gin Gly 
835 840 845 



Leu Glu Arg lie Leu Leu 
850 



<210> 3 
<211> 107 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<400> 3 

aagatgtagt aattagatct gccaatttca cagacaatgc taaaaccata atagtacagc 60 
tgaacacatc gttagaaatt aattgtacaa gacccaacaa caataca 107 



<210> 4 
<211> 120 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide for cloning 

<220> 

<221> misc_feature 
<222> (97) . . (99) 

<223> Sequence at this position: (GA) (AT) (GATC) , ie. 
base at position 97 can be G or A, base at 
position 98 can be A or T, and base at 
position 99 can be G, A, T or C . 

<400> 4 

ttttgctcta gaaatgttac aatgtgcttg tcttatgtct cctgttgcag cttctgttgc 60 
atgaaatgct ctccctggtc cgatatggat actatgrwnt tttcttgtat tgttgttggg 12 0 



<210> 5 

<211> 17 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 5 

ccatgtacaa atgtcag 

<210> 6 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 6 

aaaactgtgc gttacaa 



<210> 7 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 

<400> 7 

gtaaaacgac ggccagt 



<210> 8 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence 
sequencing primer 



<400> 8 

caggaaacag ctatgac 



<210> 9 
<211> 2148 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: synthetic DNA 



<220> 

<221> misc_f eature 

<222> (3) . . (9) 

<223> BstEII cleavage site 



<220> 

<221> misc„f eature 
<222> (2143) . . (2148) 
<223> BamHI cleavage site 



<400> 9 

tgggtcaccg tctattatgg ggtgcctgtg tggaaggaag caaccaccac tctattttgt 60 

gcatcagatg ctaaagcata tgatacagag gtacataatg tttgggccac acatgcctgt 120 

gtacccacag accccaaccc acaagaagta gtattggtaa atgtgacaga aaattttaac 180 

atgtggaaaa atgacatggt agaacagatg catgaggata taatcagttt atgggatcaa 240 

agccttaagc catgtgtaaa attaacccca ctctgtgtta gtttaaagtg cactgatttg 300 

aagaatgata ctaataccaa tagtagtagc gggagaatga taatggagaa aggagagata 3 60 

aaaaactgca gcttcaatat cagcacaagc ataagagata aggtgcagaa agaatatgca 420 

ttcttttata aacttgatat agtaccaata gataatacca gctataggtt gataagttgt 480 

aacacctcag tgatcacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 540 

tattgtgccc cggctggttt tgcgattcta aaatgtaata ataagacgtt caatggaaca 600 

ggaccatgta caaatgtcag cacagtacaa tgtacacatg gaattcgacc agtagtatca 660 

actcaactgc tgttaaatgg cagtctagca gaagaagatg tagtaattag atctgccaat 72 0 

ttcacagaca atgctaaaac cataatagta cagctgaaca catctgtaga aattaattgt 780 

acaagaccca acaacaatac aagaaaaagt atccgtatcc agaggggacc agggagagca 840 

tttgttacaa taggaaaaat aggaaatatg agacaagcac attgtaacat ttctagagca 900 

aaatggaatg ccactttaaa acagatagct agcaaattaa gagaacaatt tggaaataat 960 

aaaacaataa tctttaagca gtcatccgga ggggacccag aaattgtaac gcacagtttt 1020 

aattgtggag gggaattttt ctactgtaat tcaacacaac tgtttaatag tacttggttt 1080 

aatagtactt ggagtactga agggtcaaat aacactgaag gaagtgacac aatcacactc 1140 

ccatgcagaa taaaacaatt tataaacatg tggcaggaag taggaaaagc aatgtatgcc 12 00 

cctcccatca gtggccaaat tagatgttca tcaaatatta ctgggctgct attaactcga 1260 

gatggtggta ataacaacaa tgggtccgag attttcagac ctggaggagg cgatatgagg 1320 

gataattgga gaagtgaatt atataaatat aaagtagtaa aaattgaacc attaggagta 1380 

gcacccacca aggcaaagag acgcgtggtg cagagagaaa agcgcgcagt gggaatagga 1440 

gctctgttcc ttgggttctt gggagcagca ggaagcacta tgggcgcagc gtcaatgacg 1500 

ctgacggtac aggccagaca attattgtct gatatagtgc agcagcagaa caatttgctg 1560 

agggcaattg aggcgcaaca gcatctgttg caactcacag tctggggcat caaacagctc 1620 

caggcaagaa tcctggctgt ggaaagatac ctaaaggatc aacagctcct ggggatttgg 1680 

ggttgctctg gaaaactcat ttgcaccact gctgtgcctt ggaatgctag ttggagtaat 1740 

aaatctctgg aacagatttg gaataacatg acctggatgg agtgggacag agaaattaac 1800 

aattacacaa gcttaataca ctccttaatt gaagaatcgc aaaaccagca agaaaagaat 1860 

gaacaagaat tattggaatt agataaatgg gcaagtttgt ggaattggtt taacataaca 192 0 

aattggctgt ggtatataaa attattcata atgatagtag gaggcttggt aggtttaaga 1980 

atagtttttg ctgtactttc tatagtgaat agagttaggc agggatattc accattatcg 2040 

tttcagaccc acctcccaat cccgagggga cccgacaggc ccgaaggaat agaagaagaa 2100 

ggtggagaga gagacagaga cagatccatt cgattagtga acggatcc 2148 



<210> 10 
<211> 6229 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of the artificial sequence: synthetic DNA 
<220> 

<221> sig_peptide 
<222> (1293) . . (1295) 
<223> env ATG 

<220> 

<221> misc_f eature 

<222> (1377) . . (1379) 

<223> env AGT, gpl20 start 



<220> 

<221> misc_f eature 

<222> (1397) . . (1403) 

<223> BstEII cleavage site 

<220> 

<221> misc_feature 
<222> (3537) . . (3542) 
<223> BamHI cleavage site 

<220> 

<221> misc_feature 
<222> (3855) . . (3857) 
<223> env TAA, stop 

<400> 10 

ctgacgcgcc ctgtagcggc 
ccgctacact tgccagcgcc 
ccacgttcgc cggctttccc 
ttagtgcttt acggcacctc 
ggccatcgcc ctgatagacg 
gtggactctt gttccaaact 
tataagggat tttgccgatt 

ttaacgcgaa ttttaacaaa 
caactgttgg gaagggcgat 
gggatgtgct gcaaggcgat 
taaaacgacg gccagtgagc 
catagcccat atatggagtt 
ccgcccaacg acccccgccc 
atagggactt tccattgacg 

gtacatcaag tgtatcatat 
cccgcctggc attatgccca 
tacgtattag tcatcgctat 
ggatagcggt ttgactcacg 
ttgttttggc accaaaatca 



gcattaagcg 
ctagcgcccg 
cgtcaagctc 
gaccccaaaa 
gtttttcgcc 
ggaacaacac 
tcggcctatt 



cggcgggtgt 
ctcctttcgc 
taaatcgggg 
aacttgatta 
ctttgacgtt 
tcaaccctat 
ggttaaaaaa 



ggtggttacg 
tttcttccct 
gctcccttta 
gggtgatggt 
ggagtccacg 
ctcggtctat 
tgagctgatt 



cgcagcgtga 60 
tcctttctcg 120 
gggttccgat 180 
tcacgtagtg 240 
ttctttaata 300 
tcttttgatt 360 
taacaaaaat 42 0 



atattaacgc 
cggtgcgggc 
taagttgggt 
gtctagttat 
ccgcgttaca 
attgacgtca 
tcaatgggtg 



ttacaatttc 
ctcttcgcta 
aacgccaggg 
taatagtaat 
taacttacgg 
ataatgacgt 
gagtatttac 



cattcgccat 
ttacgccagc 
ttttcccagt 
caattacggg 
taaatggccc 
atgttcccat 
ggtaaactgc 



tcaggctgcg 480 
tggcgaaagg 540 
cacgacgttg 600 
gtcattagtt 660 
gcctggctga 720 
agtaacgcca 780 
ccacttggca 840 



gccaagtacg 
gtacatgacc 
taccatggtg 
gggatttcca 
acgggacttt 



ccccctattg 
ttatgggact 
atgcggtttt 
agtctccacc 
ccaaaatgtc 



acgtcaatga 
ttcctacttg 
ggcagtacat 
ccattgacgt 
gtaacaactc 



cggtaaatgg 900 
gcagtacatc 960 
caatgggcgt 102 0 
caatgggagt 1080 
cgccccattg 1140 



acgcaaatgg gcggtaggcg tgtacggtgg 
aaccgtcaga tcgcctggag acgccatcca 
gacaattcga gctcggtacc gtcgacgcca 
tgtggagatg ggggtggaaa tggggcacca 
ctacagaaaa attgtgggtc accgtctatt 
ccactctatt ttgtgcatca gatgctaaag 
ccacacatgc ctgtgtaccc acagacccca 
cagaaaattt taacatgtgg aaaaatgaca 
gtttatggga tcaaagccta aagccatgtg 
agtgcactga tttgaagaat gatactaata 
agaaaggaga gataaaaaac tgctctttca 
agaaagaata tgcattcttt tataaacttg 
ggttgataag ttgtaacacc tcagtcatta 
caattcccat acattattgt gccccggctg 
cgttcaatgg aacaggacca tgtacaaatg 
ggccagtagt atcaactcaa ctgctgttaa 
ttagatctgc caatttcaca gacaatgcta 
tagaaattaa ttgtacaaga cccaacaaca 
gaccagggag agcatttgtt acaataggaa 
acattagtag agcaaaatgg aatgccactt 
aatttggaaa taataaaaca ataatcttta 
taacgcacag ttttaattgt ggaggggaat 
atagtacttg gtttaatagt acttggagta. 
acacaatcac actcccatgc agaataaaac 
aagcaatgta tgcccctccc atcagtggac 
tgctattaac aagagatggt ggtaataaca 
gaggcgatat gagggacaat tggagaagtg 
aaccattagg agtagcaccc accaaggcaa 
cagtgggaat aggagctttg ttccttgggt 
gcacgtcaat gacgctgacg gtacaggcca 
agaacaattt gctgagggct attgaggcgc 
gcatcaaaca gctccaggca agaatcctgg 
tcctggggat ttggggttgc tctggaaaac 
ctagttggag taataaatct ctggaacaga 
acagagaaat taacaattac acaagcttaa 
agcaagaaaa gaatgaacaa gaattattgg 
ggtttaacat aacaaattgg ctgtggtata 
tggtaggttt aagaatagtt tttgctgtac 
attcaccatt atcgtttcag acccacctcc 
gaatagaaga agaaggtgga gagagagaca 
ccttagcact tatctgggac gatctgcgga 
gagacttact cttgattgta acgaggattg 
ccctcaaata ttggtggaat ctcctacagt 
ttaacttgct caatgccaca gccatagcag 
tattacaagc agcttataga gctattcgcc 
aaaggatttt gctataagat gggtggcaag 
gctgtaaggg aaagaatgag acgagctgag 
cgagatctag actagaacta gcttcgatcc 
acaaaccaca actagaatgc agtgaaaaaa 
tgctttattt gtaaccatta taagctgcaa 
ttttatgttt caggttcagg gggaggtgtg 
caaatgtggt atggctgatt atgatcctgc 
ctctgacaca tgcagctccc ggagacggtc 
agacaagccc gtcagggcgc gtcagcgggt 
cagtcacgta gcgatagcgg agtgtatact 
tactgagagt gcaccatatg tcgggccgcg 
cctgacgagc atcacaaaaa tcgacgctca 



gaggtctata taagcagagc tcgtttagtg 12 00 
cgctgttttg acctccatag aagacaccgg 1260 
ccatgagagt gaaggagaag tatcagcact 1320 
tgctccttgg gatattgatg atctgtagtg 1380 
atggggtacc tgtgtggaag gaagcaacca 1440 
catatgatac agaggtacat aatgtttggg 1500 
acccacaaga agtagtattg gtaaatgtga 1560 
tggtagaaca gatgcatgag gatataatca 1620 
taaaattaac cccactctgt gttagtttaa 1680 
ccaatagtag tagcgggaga atgataatgg 1740 
atatcagcac aagcataaga gataaggtgc 1800 
atatagtacc aatagataat accagctata 1860 
cacaggcctg tccaaaggta tcctttgagc 192 0 
gttttgcgat tctaaaatgt aataataaga 1980 
tcagcacagt acaatgtaca catggaatca 2040 
atggcagtct agcagaagaa gatgtagtaa 2100 
aaaccataat agtacagctg aacacatctg 2160 
atacaagaaa aagtatccgt atccagaggg 2220 
aaataggaaa tatgagacaa gcacattgta 2280 
taaaacagat agctagcaaa ttaagagaac 2340 
agcaatcctc aggaggggac ccagaaattg 2400 
ttttctactg taattcaaca caactgttta 2460 
ctgaagggtc aaataacact gaaggaagtg 2520 
aatttataaa catgtggcag gaagtaggaa 2580 
aaattagatg ttcatcaaat attactgggc 2 640 
acaatgggtc cgagatcttc agacctggag 2700 
aattatataa atataaagta gtaaaaattg 2760 
agagaagagt ggtgcagaga gaaaaaagag 2 82 0 
tcttgggagc agcaggaagc actatgggct 2 880 
gacaattatt gtctgatata gtgcagcagc 2 940 
aacagcatct gttgcaactc acagtctggg 3000 
ctgtggaaag atacctaaag gatcaacagc 3060 
tcatttgcac cactgctgtg ccttggaatg 312 0 
tttggaataa catgacctgg atggagtggg 3180 
tacactcctt aattgaagaa tcgcaaaacc 3240 
aattagataa atgggcaagt ttgtggaatt 3300 
taaaattatt cataatgata gtaggaggct 3360 
tttctatagt gaatagagtt aggcagggat 3420 
caatcccgag gggacccgac aggcccgaag 3480 
gagacagatc cattcgatta gtgaacggat 3540 
gcctgtgcct cttcagctac caccgcttga 3600 
tggaacttct gggacgcagg gggtgggaag 3660 
attggagtca ggaactaaag aatagtgctg 372 0 
tagctgaggg gacagatagg gttatagaag 3780 
acatacctag aagaataaga cagggcttgg 3840 
tggtcaaaaa gtagtgtgat tggatggcct 3900 
ccagcagcag atggggtggg agcagtatct 3960 
agacatgata agatacattg atgagtttgg 4020 
atgctttatt tgtgaaattt gtgatgctat 4080 
taaacaagtt aacaacaaca attgcattca 4140 
ggaggttttt taaagcaagt aaaacctcta 4200 
ctcgcgcgtt tcggtgatga cggtgaaaac 4260 
acagcttgtc tgtaagcgga tgccgggagc 4320 
gttggcgggt gtcggggcgc agccatgacc 4380 
ggcttaacta tgcggcatca gagcagattg 4440 
ttgctggcgt ttttccatag gctccgcccc 4500 
agtcagaggt ggcgaaaccc gacaggacta 4560 



taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg 4620 

ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc 4680 

tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac 4740 

gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac 4800 

ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg 4860 

aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg ctacactaga 492 0 

aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt 4980 

agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag 5040 

cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct 5100 

gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt atcaaaaagg 5160 

atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta aagtatatat 5220 

gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc 5280 

tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac tacgatacgg 5340 

gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg ctcaccggct 5400 

ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag tggtcctgca 5460 

actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt aagtagttcg 5520 

ccagttaata gtttgcgcaa cgttgttgcc attgctacag gcatcgtggt gtcacgctcg 5580 

tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt tacatgatcc 5640 

cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag 5700 

ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct tactgtcatg 5760 

ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt ctgagaatag 582 0 

tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac gggataatac cgcgccacat 5880 

agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa actctcaagg 5940 

atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa ctgatcttca 6000 

gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca aaatgccgca 6060 

aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct ttttcaatat 6120 

tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga atgtatttag 6180 

aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccac 6229 



<210> 11 
<211> 860 
<212> DNA 

<213> Human immunodeficiency virus 
<220> 

<221> misc_f eature 
<222> (1) . . (860) 

<223> PI-932 original sequence Vl-V2-V3-loop 
<400> 11 

tgtgtaccca cagaccccaa cccacaaaag gtagtattgg aaaatgtgac agaaaatttt 60 

aacatgtgga aaaatgacat ggtagaacag atgcatgagg atataatcaa tttatgggat 120 

caaagcctaa agccatgtgt aaaactaacc ccactctgtg ttactttaaa ttgcactgat 180 

gctgatttaa attgcaataa tactgattta aattgcacta aagctaattt ggggaaaaat 240 

actcataaca atactattag tgggaaaata atagagaaag tagaaataaa aaactgctct 300 

ttcaaggtca ccacaggcat aagggataag atgcaaaaag aatatgcact tttgaataaa 3 60 

cttgatatag taccaataga taatgataag aataatacta actttatatt gataagttgt 420 

aacacctcga ccattacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 480 

ttttgtgccc cggctggttt tgcgattcta aagtgtaatg aaaagagtta cagtggaaaa 540 

ggaccatgta aaaatgtcag cacagtacaa tgtacacatg gaattaggcc agtagtgtca 600 

actcaactgc tgttgaatgg cagtctagca gaaaaagaag tagtaattag atctgagaat 660 

ttcacagaca atgctaaaac cataatagta cagctgaagg aatctgtaaa cattacttgt 72 0 

ataagacccc acaacactgt aacagacagg atacatatag ggccagggag atcatttcat 780 

acaacaagaa aaataaaagg agatataaga caagcacatt gtagccttag gagaaaagat 840 



tggaataaca ctttacaaga 



860 



<210> 12 
<211> 870 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of the artificial sequence: PI-932 

gene cassette, comprising the cleavage sites for 
restriction enzymes BspTl, PstI, Bell, EcoRI, 
Bglll, PvuII, Xball, Nhel 



<400> 12 

tgtgtaccca cagaccccaa cccacaaaag gtagtattgg aaaatgtgac agaaaatttt 60 
aacatgtgga aaaatgacat ggtagaacag atgeatgagg atataatcaa tttatgggat 12 0 
caaagectta agccatgtgt aaaactaacc ccactctgtg ttactttaaa ttgeactgat 180 
gctgatttaa attgeaataa tactgattta aattgeacta aagctaattt ggggaaaaat 240 
actcataact gcagtattag tgggaaaata atagagaaag tagaaataaa aaactgetet 3 00 
ttcaaggtca ccacaggcat aagggataag atgcaaaaag aatatgeact tttgaataaa 3 60 
cttgatatag taccaataga taatgataag aataatacta actttatatt gataagttgt 420 
aacacctcgg tgatcacaca ggcctgtcca aaggtatcct ttgagccaat tcccatacat 480 
ttttgtgccc cggctggttt tgegattcta aagtgtaatg aaaagagtta cagtggaaaa 540 
ggaccatgta aaaatgtcag cacagtacaa tgtacacatg gaatteggee agtagtgtca 600 
actcaactgc tgttgaatgg cagtctagca gaaaaagaag tagtaattag atctgagaat 660 
ttcacagaca atgetaaaac cataatagta cagctgaagg aatctgtaaa cattacttgt 720 
ataagacccc acaacactgt aacagacagg atacatatag ggecagggag atcatttcat 780 
acaacaagaa aaataaaagg agatataaga caagcacatt gtagecttte tagaaaagat 840 
tggaataaca ctttacaaga gatagctagc 870 



<210> 13 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: V3 loop 
sequence of HIV-1 patient isolate 

<400> 13 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser lie His lie Gly Pro 
1 5 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Asp lie lie Gly Asp lie Arg Gin 
20 25 30 



Ala His Cys 
35 



<210> 14 
<211> 35 
<212> PRT 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PI-903 isolate 
<400> 14 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Gly Ser lie His lie Gly Pro 
15 10 15 

Gly Ser Thr Asn Tyr Ala Thr Gly Ala He He Gly Asp He Ser Gin 
20 25 30 

Ala His Cys 
35 



<210> 15 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-951 isolate 
<400> 15 

Cys Thr Arg Pro His Asn Asn Thr Arg Lys Ser He Asn He Gly Pro 
15 10 15 

Gly Arg Ala Trp Tyr Thr Thr Gly Asp He He Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 16 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-918 isolate 
<400> 16 

Cys Thr Arg Pro Ser Asn Asn Thr Arg Lys Ser lie His He Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 17 
<211> 35 
<212> PRT 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-970 ISOLATE 
<400> 17 

Cys Thr Arg Pro Ser Asn Asn Thr Arg Lys Ser lie His lie Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Glu He He Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 18 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-990 ISOLATE 
<400> 18 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Arg Ser He Pro He Gly Pro 
15 10 15 

Gly Arg Ala Phe Tyr Thr Thr Gly Asp He Val Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 19 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-991 ISOLATE 
<400> 19 

Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser He Pro He Ala Pro 
15 10 15 

Gly Arg Ala Phe Tyr Ala Thr Gly Glu He He Gly Asn He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 20 
<211> 35 
<212> PRT 



<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: PI-952 ISOLATE 
<400> 20 

Cys lie Arg Pro Asn Asn Asn Thr Arg Lys Arg lie Thr Leu Gly Pro 
15 10 15 

Gly Arg Val Leu Tyr Thr Thr Gly Glu He He Gly Asp He Arg Lys 
20 25 30 

Ala His Cys 
35 



<210> 21 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-93 ISOLATE 
<400> 21 

Cys He Arg Pro His Asn Thr Val Thr Asp Arg He His He Gly Pro 
15 10 15 

Gly Arg Ser Phe His Thr Thr Arg Lys He Lys Gly Asp He Arg Gin 
20 25 30 

Ala His Cys 
35 



<210> 22 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-910 ISOLATE 
<400> 22 

Cys Thr Arg Pro Ser He Gin Lys Arg Arg Ser Val Arg He Gly Pro 
15 10 15 

Gly Arg Ser Phe He Ala Ala Arg Ala Ala Thr Gly Asp He Arg Lys 
20 25 30 

Ala Gin Cys 
35 



<210> 23 
<211> 35 
<212> PRT 



<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-911 ISOLATE 
<400> 23 

Cys Thr Arg Pro Ser He Gin Lys Arg Arg Ser Val Arg He Gly Pro 
15 10 15 

Gly Arg Ser Phe He Ala Thr Arg Ala Ala Thr Gly Asp He Arg Lys 
20 25 30 

Ala Gin Cys 
35 



<210> 24 
<211> 35 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PI-930 ISOLATE 
<400> 24 

Cys Thr Arg Pro Tyr Arg Asn Ala Lys His Arg He Met His lie Gly 
15 10 15 

Pro Gly Arg Ala Phe Tyr Ala Thr Asn Val Lys Gly Asn He Lys Gin 
20 25 30 

Ala His Cys 
35 



<210> 25 
<211> 51 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: NL4-3 V2-Loop 
<400> 25 

Gly Glu He Lys Asn Cys Ser Phe Asn He Ser Thr Ser He Arg Asp 
1 5 10 15 

Lys Val Gin Lys Glu Tyr Ala Phe Phe Tyr Lys Leu Asp He Val Pro 
20 25 30 

He Asp Asn Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val lie 
35 40 45 



Thr Gin Ala 
50 



<210> 26 
<211> 59 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: NL4-3 V3-Loop 
<400> 26 

Val Gin Leu Asn Thr Ser Val Glu lie Asn Cys Thr Arg Pro Asn Asn 
15 10 15 

Asn Thr Arg Lys Ser lie Arg lie Gin Arg Gly Pro Gly Arg Ala Phe 
20 25 30 



Val Thr lie Gly Lys lie Gly Asn Met Arg Gin Ala His Cys Asn lie 
35 40 45 

Ser Arg Ala Lys Trp Asn Ala Thr Leu Lys Gin 
50 55 

<210> 27 

<211> 37 

<212> PRT 

<213> HIV-1 



<220> 

<221> sequence variation in position 1 

<222> (1)..(1) 

<223> Xaa can be N, T, H, D, K, S, E, I, Y, or can be absent 
<220> 

<221> sequence variation in position 3 

<222> (3).. (3) 

<223> Xaa can be T, I, V, S, E, L, A, or M 



<220> 
<221> 



sequence variation in position 4 



<222> (4).. (4) 

<223> Xaa can be R, S, T, or V 
<220> 

<221> sequence variation in position 5 

<222> (5) . . (5) 

<223> Xaa can be P, L, or T 

<220> 

<221> sequence variation in position 6 

<222> (6).. (6) 

<223> Xaa can be N, S, G, H, T, Y, D, or F 
<220> 

<221> sequence variation in position 7 

<222> (7).. (7) 

<223> Xaa can be N, T, K, D, I, E, R, H, or Y 
<220> 

<221> sequence variation in position 8 

<222> (8).. (8) 

<223> Xaa can be N, Y, K, T, I, H, S, E, Q, or can be absent 
<220> 

<221> sequence variation in position 9 

<222> (9). .(9) 

<223> Xaa can be T, K, I, R, V, A, or S 



<220> 



<221> sequence variation in position 10 

<222> (10) . . (10) 

<223> Xaa can be R, S, I, K, M, G, N, Q, A, L, T, or E 
<220> 

<221> sequence variation in position 11 

<222> (11).. (11) 

<223> Xaa can be K, R, Q, S, T, N, E, L, G, or I 
<220> 

<221> sequence variation in position 12 

<222> (12).. (12) 

<223> Xaa can be S, G, R, H, D, C, K, T, V, or N 
<220> 

<221> sequence variation in position 13 

<222> (13).. (13) 

<223> Xaa can be I, V, L, M, K, R, T, F, or Y 
<220> 

<221> sequence variation in position 14 

<222> (14).. (14) 

<223> Xaa can be H, P, N, T, S, Y, R, Q, A, G, I, F , or K 
<220> 

<221> sequence variation in position 15 

<222> (15).. (15) 

<223> Xaa can be I, L, M, V, F, T, S, K, R, or Y 



<220> 

<221> sequence variation in position 16 

<222> (16).. (16) 

<223> Xaa can be G, A, E, R, Q, or T 
<220> 

<221> sequence variation in position 17 

<222> (17).. (17) 

<223> Xaa can be P, W, L, Q, F, G, S, A, M, V, R, or T 
<220> 

<221> sequence variation in position 18 

<222> (18).. (18) 

<223> Xaa can be G, R, K, E, or W 
<220> 

<221> sequence variation in position 19 

<222> (19).. (19) 

<223> Xaa can be R, Q, K, S, G, A, H, E, N, or can be absent 
<220> 

<221> sequence variation in position 20 

<222> (20) . . (20) 

<223> Xaa can be A, V, T, S, R, G, K, Q, L, N, Y, or can be absent 
<220> 

<221> sequence variation in position 21 



<222> 
<223> 



(21) . . (21) 

Xaa can be F, W, L, V, I, Y, S, M, C, N, or P 



<220> 

<221> sequence variation in position 22 

<222> (22).. (22) 

<223> Xaa can be Y, F, H, V, R, I, N, W, D, T, C, L, or S 
<220> 

<221> sequence variation in position 23 

<222> (23) . . (23) 

<223> Xaa can be T, A, G, R, V, S, K, or can be absent 
<220> 

<221> sequence variation in position 24 

<222> (24).. (24) 

<223> Xaa can be T, A, R, I, S, P, N, K, Y, G, or can be absent 
<220> 

<221> sequence variation in position 25 

<222> (25) . . (25) 

<223> Xaa can be G, E, R, D, K, A, T, Q, S, or can be absent 
<220> 

<221> sequence variation in position 26 

<222> (26).. (26) 

<223> Xaa can be E, D, Q, R, G, A, K, N, H, T, S, P, I, M, Y, 
V, or can be absent 



<220> 

<221> sequence variation in position 27 

<222> (27).. (27) 

<223> Xaa can be I, V, T, L, K, H, N, R, or can be absent 
<220> 

<221> sequence variation in position 28 

<222> (28).. (28) 

<223> Xaa can be I, V, T, A # M, K, L, F, N, Q, or can be absent 
<220> 

<221> sequence variation in position 29 

<222> (29).. (29) 

<223> Xaa can be G, E , V, R, or can be absent 
<220> 

<221> sequence variation in position 30 

<222> (30).. (30) 

<223> Xaa can be D, N, E, K, G, Q, T, L # H, I, A, V, Y, or can 
be absent 

<220> 

<221> sequence variation in position 31 

<222> (31).. (31) 

<223> Xaa can be I, T, V, L, M, K, R, or can be absent 
<220> 

<221> sequence variation in position 32 

<222> (32).. (32) 



<223> 



Xaa can be R, K, S, G, T, I, or can be absent 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



sequence variation in position 33 
(33) . . (33) 

Xaa can be Q, K, R, E, L, H, T, or A 



sequence variation in position 34 
(34) . . (34) 

Xaa can be A, P, S, or T 



sequence variation in position 35 
(35) . . (35) 

Xaa can be H, Y, Q, F, R, N, or A 



sequence variation in position 36 
(36) . . (36) 

Xaa can be C, S, K, or can be absent 



sequence variation in position 37 
(37) . . (37) 

Xaa can be N, T, I, D, S, E, K, A, V, Y, H, or can be absent 



<400> 27 

Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa Xaa Xaa Xaa 
35 



